Industrial Scene Text Detection With Refined Feature-Attentive Network
نویسندگان
چکیده
Detecting the marking characters of industrial metal parts remains challenging due to low visual contrast, uneven illumination, corroded character structures, and cluttered background part images. Affected by these factors, bounding boxes generated most existing methods locate low-contrast text areas inaccurately. In this paper, we propose a refined feature-attentive network (RFN) solve inaccurate localization problem. Specifically, design parallel feature integration mechanism construct an adaptive representation from multi-resolution features, which enhances perception multi-scale texts at each scale-specific level generate high-quality attention map. Then, attentive refinement is developed map rectify location deviation candidate boxes. addition, re-scoring designed select with best rectified location. Moreover, two scene datasets, including total 102156 images 1948809 instances various structures parts. Extensive experiments on our dataset four public datasets demonstrate that proposed method achieves state-of-the-art performance.
منابع مشابه
Feature Enhancement Network: A Refined Scene Text Detector
In this paper, we propose a refined scene text detector with a novel Feature Enhancement Network (FEN) for Region Proposal and Text Detection Refinement. Retrospectively, both region proposal with only 3 × 3 sliding-window feature and text detection refinement with single scale high level feature are insufficient, especially for smaller scene text. Therefore, we design a new FEN network with ta...
متن کاملPerspective Scene Text Recognition with Feature Compression and Ranking
In this paper we propose a novel character representation for scene text recognition. In order to recognize each individual character, we first adopt a bag-of-words approach, in which the rotation-invariant circular Fourier-HOG features are densely extracted from an individual character and compressed into middle level features. Then we train a set of two-class linear Support Vector Machines in...
متن کاملRobust Scene Text Detection with Convolution Neural Network Induced MSER Trees
Maximally Stable Extremal Regions (MSERs) have achieved great success in scene text detection. However, this low-level pixel operation inherently limits its capability for handling complex text information efficiently (e. g. connections between text or background components), leading to the difficulty in distinguishing texts from background components. In this paper, we propose a novel framewor...
متن کاملScene Text Area Detection from Video
Text detection from videos is a well known research area. Especially the detection of static superimposed text such as captions has been researched successfully, but makes many assumptions that question the applicability of those algorithms for moving scene text. In this dissertation, I propose a scene text area detection approach that includes a simple key frame extraction, feature extraction,...
متن کاملText Detection in Indoor/Outdoor Scene Images
In this paper, we propose a novel methodology for text detection in indoor/outdoor scene images. The proposed methodology is based on an efficient binarization and enhancement technique followed by a suitable connected component analysis procedure. Image binarization successfully process indoor/ outdoor scene images having shadows, non-uniform illumination, low contrast and large signal-depende...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology
سال: 2022
ISSN: ['1051-8215', '1558-2205']
DOI: https://doi.org/10.1109/tcsvt.2022.3156390